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SECTION  I 


INTRODUCTION 


The  performance  of  computer  systems  is  usually  evaluated  for 
the  purposes  of  determining  its  present  value,  methods  of  improving 
it  and  predicting  the  effects  of  changes  in  either  the  workload  or 
the  system.  One  method  of  studying  these  factors  is  to  conduct 
experiments  with  an  existing  system  using  a test  workload.  This 
report  describes  the  results  of  and  experiences  from  an  experimental 
investigation  with  a Burroughs  B 3500  computer  system  at  ESD,  Hanscom 
Air  Force  Base. 

In  order  to  conduct  these  experiments  there  is  a need  for  a 
stable,  reproducible  and  flexible  test  workload  that  imitates  the 
real  workload  with  reasonable  fidelity  but  in  an  abbreviated  form. 

It  is  necessary  that  the  test  workload  be  stable  and  reproducible 
so  that  the  experimental  results  are  interpreted  correctly;  the  test 
workload  be  flexible  so  that  the  characteristics  of  the  test  work- 
load may  be  varied;  and  the  test  workload  be  a representative  model 
of  the  real  workload  so  that  valid  conclusions  may  be  drawn  from  its 
use.  All  these  requirements  can  be  fulfilled  by  using  synthetic 
programs  in  the  test  workload. 

The  test  workload  used  in  the  experiments  is  constructed  by 
matching  the  joint  probability  density  for  the  selected  workload 
characteristics  of  the  real  workload  with  those  of  the  test  work- 
load. The  selected  characteristics  are  the  processor  time,  the  I/O 
activities  to  disk,  the  number  of  files  and  the  amount  of  core  used 
by  each  job.  These  characteristics  are  determined  from  the  LOGGER 
accounting  data  maintained  by  the  Master  Control  Program  (MCP-V)  of 
the  Burroughs  B 3500  system.  The  details  of  the  various  types  of 
records  collected  by  LOGGER  are  studied  and  special  conversion 
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packages  are  implemented  to  create  a job  summary  record  for  every 
job  processed.  These  summary  records  are  used  to  determine  the  test 
workload  characteristics . 

In  these  experiments  the  utilization  of  the  processor  and  the 
channel  is  measured  using  a DYNAPROBE  7900^  hardware  monitor.  These 
utilization  values  are  also  calculated  using  the  LOGGER  data.  The 
two  sets  of  values  compare  favorably.  This  is  a significant  finding 
of  this  investigation.  The  utilization  values  of  the  system  resources 
are  essential  in  the  evaluation  of  the  computer  performance.  This 
study  indicates  that  the  accounting  data  (LOGGER)  provides  a ready 
and  economical  source  for  determining  the  resource  utilization.  A 
system  manager  can  conveniently  assess  the  system  performance  by 
periodic  processing  of  the  accounting  data  to  determine  the  resource 
utilization. 

The  test  workload  is  only  a model  of  the  real  workload  and  is 
constructed  by  striking  a compromise  between  the  needs  of  representa- 
tiveness and  physical  constraints  such  as  the  processor  time  used. 

Such  a compromise  necessarily  renders  the  synthetic  workload  not  com- 
pletely representative  of  all  the  features  of  the  real  workload.  This 
is  indeed  the  case  in  our  study  where  the  test  workload  is  constructed 
using  only  four  of  the  many  characteristics  recorded  by  the  accounting 
data.  The  tape  I/O  activities  are  not  represented  because  it  was 
decided  not  to  consider  the  human  factors  involved  in  the  tape  handling. 
Special  measurements  have  to  be  made  to  include  the  tape  I/O  activi- 
ties in  the  test  workload.  These  measurements  relate  to  the  method 
of  handling  tapes. 

The  construction  of  the  test  workload  is  outlined  in  Section  II. 
Section  III  describes  the  experiments.  The  report  is  summarized  in 
Section  IV  and  the  method  of  calculating  the  resource  utilization  is 
described  in  the  Appendix. 

*Comress,  Incorporated. 
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SECTION  II 


CONSTRUCTION  OF  TEST  WORKLOAD 


GENERAL  DESCRIPTION 

A workload  is  defined  as  the  collection  of  all  the  individual 
jobs  that  are  processed  by  a computer  system  during  a specified 
period  of  time.  The  computer  system  can  be  considered  as  a collec- 
tion of  resources  upon  which  the  workload  places  certain  demands. 

The  magnitude  of  these  demands  can  be  viewed  as  the  characteristic 
variables  of  the  real  workload.  A job,  such  as  a compilation,  matrix 
inversion,  or  sort-merge  can  be  described  by  a set  of  these  variables 
whose  magnitudes  will  vary  from  one  type  of  job  to  another,  and  from 
one  computer  system  to  another. 

In  this  method  of  characterizing  the  workload  it  is  assumed  that 
since  the  system  does  not  recognize  the  type  of  job,  two  jobs  reflec- 
ting the  same  value  for  the  characteristic  variables  are  treated 
identically  by  the  operating  system.  This  assumption  is  reasonable 
since  the  operating  system  classifies  jobs  on  the  basis  of  the 
demands  they  place  on  the  system  resources. 

Many  computer  installations  maintain  a system  accounting  package 
that,  for  charging  purposes,  gathers  information  about  the  use  of 
various  system  resources.  This  information  provides  a ready  source 
of  data  from  which  many  of  the  workload  characteristics  can  be 
derived.  To  determine  the  characteristics  not  available  from  the 
accounting  data,  hardware  and  software  monitors  must  be  used. 

In  many  installations  a significant  part  of  the  I/O  activities 
may  be  directed  to  tape.  This  was  the  case  in  this  study  where  the 
workload  of  the  B 3500  (ESD,  Hanscom  Air  Force  Base)  was  studied.  In 
such  cases  the  accounting  data  seldom  record  the  details  of  the  I/O  acti- 
vities to  tape.  Examples  of  these  details  are:  number  of  tape  mounts 
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and  dismounts,  time  per  mount,  number  of  jobs  requesting  the  same 
tape  drive,  and  forced  idle  time  for  a job  because  of  conflicts  in 
tape  drive  availability.  These  details  are  significant  and  contribute 
to  the  overall  performance  of  the  computer  system.  Although  these 
details  are  system  dependent  they  can  best  be  described  as  human  fac- 
tors that  influence  the  throughput  of  the  workload.  Mounting  and 
dismounting  of  tapes  are  not  only  influenced  by  the  number  of  tapes 
and  the  location  of  the  tape  library  but  also  by  the  number  of  opera- 
tors available. 

The  method  of  characterizing  the  workload  described  in  this  report 
does  not  consider  the  human  factors.  This  study  used  the  total  number 
of  I/O  activities  to  tape  or  to  disk  as  a measure  of  the  I/O  load. 

This  quantity,  admittedly,  neglects  the  time  for  mount  and  dismount 
and  this  is  a limitation  in  this  method  of  characterization. 

The  tape  I/O  can  be  accounted  for  by  creating  tape  files  in  the 
synthetic  program.  Initial  calibration  experiments  must  be  conducted 
to  relate  the  tape  block  counts  (the  number  of  blocks  read  or  written 
to  tape)  with  the  corresponding  synthetic  program  parameters.  In  the 
case  of  tape  files,  not  all  the  block  counts  (recorded  by  the  account- 
ing package)  lead  to  data  transfer;  some  of  them  result  in  tape 
positioning,  label  checking  and  tape  rewinding.  The  fraction  of  the 
total  block  count  resulting  in  data  transfer  can  be  determined  in  the 
preliminary  calibration  experiments.  The  tape  handling  in  the  real 
workload  should  then  be  studied  to  estimate  the  time  taken  for  res- 
ponding to  the  mount  request  and  the  number  of  tape  files  processed 
by  individual  jobs.  Based  on  these  studies  it  is  possible  to  execute 
the  synthetic  program  creating  tape  files  and  introduce  known  amounts 
of  delay  to  simulate  tape  handling. 
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PREPROCESSING  OF  SYSTEM  LOG  FILE  DATA 
LOGGER  Data 


The  LOGGER  provided  the  source  of  performance  data  for  this 
report.  LOGGER  is  the  accounting  package  provided  as  a part  of  the 
MCP  operating  system  for  the  B 3500.  LOGGER  records  the  following 
event-oriented  statistics.  They  are  usually  referred  to  as  the 
Type  X records  where  X is  any  one  of  the  following. 


Type 

0 

File  Close  Record 

Type 

1 

File  Open  Record 

Type 

2 

End  of  Job  Record 

Type 

3 

Long  Schedule  Record 

Type 

4 

Short  Schedule  Record 

Type 

5 

Comment  Record 

% 

Type 

6 

Beginning  of  Job  Record 

Type 

7 

Idle  Time  Record 

9 

Type 

8 

Halt /load  Record 

Type 

9 

Filler  Record 

Data  Structure 

The  accounting  package  collects  and  records  data  in  the  event- 
oriented  format  as  and  when  the  events  take  place.  Typically  they 
are:  job  A begins;  job  A opens  file  x;  job  B begins;  job  A closes 

file  x;  etc.  They  are  also  referred  to  as  the  raw  data.  These  data 
are  uniquely  identified  by  the  job-log  number  (or  the  job  ID)  and 
using  this  field  as  the  key  the  raw  data  are  summarized  into  job- 
oriented  format,  also  referred  to  as  the  summary  data.  The  job- 
oriented  data,  essentially,  are  the  magnitudes  of  the  demands  placed 
on  the  various  system  resources  by  the  individual  jobs.  It  should 
be  noted  that  one  job-oriented  record  is  obtained  by  summarizing 
several  event-oriented  records. 
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The  performance  variables  of  interest  for  this  workload  charac- 
terization were  taken  from  four  of  the  LOGGER  record  types:  Type  0, 

File  Close  Record;  Type  1,  File  Open  Record;  Type  2,  End  of  Job 
Record;  and  Type  6,  Beginning  of  Job  Record.  The  record  layouts  for 
each  of  these  types  are  shown  in  Figures  1 and  2.^ 

The  standard  internal  character  set  for  the  B 2500/B  3500  is  the 
8-bit  Extended  Binary  Coded  Decimal  Interchange  Code  (EBCDIC) . In- 
ternal storage  for  the  accounting  data  is  organized  into  4-bit  digits 
and  is  processed  under  two  formats:  unsigned  4-bit  numeric  and  8-bit 

alphanumeric.  Most  of  the  data  uses  the  unsigned  4-bit  numeric  format 

[2  ] 

which  is  the  Burroughs  structure  for  high  density  storage  of  data. 

The  voluminous  quantity  of  event-oriented  records  on  system  activity 
must  be  summarized  into  usable  job-oriented  summary  records. 

An  existing  program  written  in  PL/I  for  the  IBM  370/155  (MITRE, 

Bedford)  was  modified  and  used  to  create  job  summary  records. 

It  was  necessary,  therefore,  to  convert  the  Burrough’s  LOGGER 
data  into  IBM  compatible  data  types.  There  is  no  conversion 
necessary  for  the  8-bit  alphanumeric  data  but  the  unsigned  4-bit  numeric 
must  be  changed  by  adding  a sign  and  aligning  digits  on  8-bit  (character) 
boundaries . 

ACCESSIBILITY 

The  event-oriented  data  collected  by  the  LOGGER  is  readily  acces- 
sible. The  data  set  receiving  this  raw  data  is  normally  a disk  file. 

Log  procedures  of  the  B 3500  accounting  system  store  information  into 
one  of  three  disk  data  sets  called  System  Log  Files.  The  raw  data 
of  interest  in  this  analysis  is  in  the  system  run  (current)  log 
file  and  is  named  RLOG.  The  disk  files  are  periodically  emptied 
onto  tape  volumes  for  storage.  The  RLOG  data,  consequently,  is  avail- 
able from  either  the  active  disk  file  or  the  stored  tape  file. 
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CLOSE 


OPEN 


Position 

Contents 

Position 

Contents 

0 

Type  Code 

0 

Type  Code 

1-4 

Reserved 

1-4 

Reserved 

5-8 

Log  ID  Number 

5-8 

Log  ID  Number 

9-14 

Date  (MMDDYY) 

9-14 

Date  (MMDDYY) 

15-22 

Time  (msecs) 

15-22 

Time  (msecs) 

23 

Subtype 

23 

Subtype 

24-35 

File  ID 

24-35 

File  ID 

36-47 

Multi-File  ID 

36-47 

Multi-File  ID 

48-49 

File  Number 

48-49 

File  Number 

50-51 

Primary  I/O  Channel 

50-51 

Primary  I/O  Channel 

52 

Unit  Number 

52 

Unit  Number 

53-54 

Hardware  Type  Used 

53-54 

Hardware  Type  Requested 

55 

Supplementary  Hardware  Code 

55 

Supplementary  Hardware  Code 

56-58 

Reel  Number 

56-58 

Reel  Number 

59-63 

Physical  Tape  Number 

59-63 

Creation  Date 

64-65 

Close  Type  Code 

64-65 

Cycle  Number 

66-73 

Logical  Record  Count 

66-70 

Maximum  Record  Length 

74-81 

Physical  Record  Count 

71-73 

Records  per  Block 

82-84 

Exception  Count 

74-79 

Maximum  Block  Size 

85-86 

Number  of  Disk  Areas  Used 

80 

Buffer  Access  Technique 

87-94 

Disk  End-of-File  Pointer 

81 

File  Label  Convention 

95 

Memory  for  Disk  File  Headers 

82 

Number  of  Alternate  Areas 

96-99 

Reserved 

83 

OPEN  Type  Code 

84 

Recording  Mode 

85 

Blocking  Technique 

86 

Special  Forms  Indicator 

87-89 

Save  Factor 

90-96 

Disk  Segments  per  Area 

97 

Disk  Access  Technique 

98 

Disk  File  Header  Block  Count 

99 

Reserved 

Figure  1.  File  Close  and  Open  Records 
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END  BEGIN 


Position 

Contents 

Position 

Contents 

0 

Type  Code 

0 

Type  Code 

1-4 

Reserved 

1-4 

Reserved 

5-8 

Log  ID  Number 

5-8 

Log  ID  Number 

9-14 

Data  (MMDDYY) 

9-14 

Date  (MMDDYY) 

15-22 

Time  (msecs) 

15-22 

Time  (msecs) 

23 

Subtype 

23 

Subtype 

24-35 

Program  ID 

24-35 

Program  ID 

36-47 

Multi-program  ID 

36-47 

Multi-program  ID 

48-49 

Job  Number 

48-49 

Job  Number 

50-51 

Primary  I/O  Channel 

50-55 

Disk  Segments  in  Program 

52 

Unit  Number 

56-61 

User  Charge  Number 

53-54 

Hardware  Type 

62-64 

Core  Required 

55 

Supplementary  Hardware  Type 

65-66 

Number  of  Files 

56-61 

Overlay  Count 

67-68 

Number  of  Disk  Files 

62-63 

Finish  Code 

69 

Execution  Code 

64-73 

Reserved 

70 

Reserved 

74-81 

Direct  Processor 

71 

Supplementary  Execution  < 

82-89 

Prorated  Processor  Time 

72 

MCP  Intrinsic  Flag 

90-97 

Accumulated  Program  I/O  Wait  Time 

73 

Reserved 

98-99 

Reserved 

74-79 

Date  Compiled 

80-99 

Reserved 

Figure  2.  End  of  Job  and  Beginning  of  Job  Records 
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COLLECTION  OF  JOB-ORIENTED  DATA 
Detailed  Format 


The  job  summary  record  in  Figure  3 consists  of  performance 
variables  of  interest  for  a workload  characterization.  A job  record 
to  be  used  for  another  purpose,  e.g.,  billing  reports,  would  probably 
embody  a significantly  different  set  of  information.  The  Log  ID 
Number,  Job  Start  Time,  and  Core  Required  fields  are  taken  from  BOJ 
record  type  6,  Figure  2.  EOJ  record  type  2,  Figure  2,  yields  Job 
End  Time,  Direct  Processor  Time,  Prorated  Processor  Time,  Hardware 
Type,  and  job  Log  ID.  Elapsed  Time  is  computed  using  EOJ  and  BOJ 
times  taken  from  type  2 and  type  6 records  respectively.  The  remain- 
ing fields  itemizing  files,  devices,  channels,  and  block  counts  are 
taken  from  the  file  close  records,  type  0,  Figure  1. 

Method 

Each  event-oriented  record  is  read  in  the  order  it  was  created 
and  checked  for  type  6,  the  BOJ  record.  A new  BOJ  record  defines  a 
unique  job  identifier  by  means  of  the  Log  ID  Number.  All  subsequent 

records  pertaining  to  a particular  job  are  found  by  using  this  iden- 
tifier. As  each  job  starts  it  is  added  to  the  mix  and  with  each  EOJ 
the  job  is  removed  by  updating  the  mix  count.  All  file  open  and 
close  records  for  a particular  job  are  processed  by  counting  the 
number  of  different  files,  devices,  and  channels  that  appear  and  by 
breaking  down  the  block  count  into  file,  device,  and  channel  sub- 
totals. The  EOJ  record  finishes  the  summary  cycle  for  a job.  The 
processor  time  is  extracted  from  this  final  record  and  the  completed 
job  summary  record  of  Figure  3 is  recorded.  Table  I shows  a partial 
list  of  summary  records  created  by  processing  one  month  of  raw 
accounting  data. 

WORKLOAD  CHARACTERISTICS 

The  system  dependent  characteristics  of  a month’s  workload  pro- 
cessed by  B 3500  were  analyzed  in  order  to  determine  the  test  workload 
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Position 


Length  (Bytes) 


0-2 

3-7 

8-12 

13-17 

18-22 

23-27 

28-32 

33-34 

35-39 

40-41 

42-43 

44-45 

46-V 

V 

V 

V 

V 

V 

V 

V 

V 


Data  Type 


3 PD 

5 PD 

5 PD 

5 PD 

5 PD 

5 PD 

5 PD 

2 PD 

5 PD 

2 B 

2 B 

2 B 

L x 6 CH 

M x 2 PD 

N x 2 PD 

M x 2 B 

N x 2 B 

N x 2 B 

L x 5 PD 

M x 5 PD 

N x 5 PD 


Contents 

Log  ID  Number 
Job  Start  (msecs) 

Job  End  Time  (msecs) 

Elapsed  Time  (msecs) 

Direct  Processor  Time  (msec) 
Prorated  Processor  Time  (msec) 
Total  Processor  Time  (msec) 
Core  Required  (kilo-bytes) 
Total  Block  Count 
Number  of  Files  (L) 

Number  of  Devices  (M) 

Number  of  Channels  (N) 

File  Names 
Device  Types 
Channels 

Files  per  Device 
Files  per  Channel 
Devices  per  Channel 
Blocks  per  File 
Blocks  per  Device 
Blocks  per  Channel 


Key:  PD  - Packed  Decimal 

B - Binary 

CH  - CHaracter 

L - Number  of  Files 

M - Number  of  Devices 

N - Number  of  Channels 

\ 

V - Variable 


Figure  3.  Job  Summary  Record  Used  in  Workload  Characterization 
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Table  I — Typical  B 3500  Job  Summary  Records  (Fixed  portion  only) 


characteristics.  Single  variable  histograms  were  constructed  to  iso- 
late the  major  variables.  The  following  histograms  were  studied: 


(i) 

Direct  Time 

(2) 

I/O  activities  to  disk 

(3) 

I/O  activities  to  tape 

(4) 

Core  used 

(5) 

Number  of  files 

(6) 

Number  of  channels 

As  an 

example,  the  histogram  of  the  direct  time 

is 

tabulated  in 

Table 

II.  Based  on  these  histograms  it  was  decided 

to  characterize 

a job 

by  the  following 

four  variables. 

(i) 

Direct  Time,  seconds 

(2) 

Number  of  block  counts 

to 

disk 

(3) 

Core  used 

(4) 

Number  of  files 

The  direct  time  is  the  demand  placed  on  the  CPU  and  is  the  processor 
time  in  the  user  state  or  the  normal  state  (or  the  problem  program 
state).  The  number  of  block  counts  is  the  number  of  blocks  of  data 
exchanged  between  main  storage  and  the  disk.  The  other  two  variables 
are  self-explanatory.  These  four  job-oriented  characteristics  were 
determined  from  the  derived  LOGGER  summary  data.  The  general  method 
of  the  statistical  analyses  to  determine  the  workload  characteristics 
appears  in  Reference  3. 

TEST  WORKLOAD  CHARACTERISTICS 

The  test  workload  characteristics  were  determined  by  matching 
the  four-dimensional  probability  density  of  the  test  workload  with 
that  of  the  real  workload.  The  details  of  this  method  appear  in 
Reference  4 and  will  be  briefly  presented  here. 
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Table  II  — Blatograa  of  Procaaaor 
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Table  II  — Hietogre*  of  Processor  Tl»e  (Continued) 


The  B 3500  workload  is  characterized  by  the  following  four 
variables. 


- Number  of  files 
“ Core  used 
X^  - Direct  time 

X,  - Number  of  block  counts  to  disk 
4 


A job,  therefore,  may  be  regarded  as  a point  in  the  multi-dimensional 
space,  with  the  co-ordinates  representing  each  of  the  four  demands. 
The  workload  then  becomes  an  ensemble  of  points  in  this  co-ordinate 
system. 


In  general,  if  X^,  X^,  X^,  and  X^  are  the  four  variables  used 
to  describe  the  workload  and  is  the  number  of  jobs  in  the  cell* 

with  co-ordinates  X^  = X^  = x^\  X^  = x^^\  X^  = then 

the  workload  can  be  described  as  a multi-variate  probability  density 
function  and  its  value  is  given  by: 


ijkl  N 


= Niikl 


tot 


i,  j , k,  1 ■ 1,  2,  3. . .L  (2.1) 


where  p ^ is  t^ie  probability  of  finding  a job  in  the  (i-j-k-l)-th 

cell  and  N is  the  total  number  of  jobs  in  the  workload.  L is  the 
tot  J 

number  of  cells  along  each  co-ordinate  axis. 

The  test  workload  is  constructed  by  matching  the  joint  probability 
density  of  the  four  variables  for  the  test  workload  with  that  of  the 


* The  magnitude  of  the  variable  corresponding  to  each  cell  is  taken 
to  be  its  value  at  the  mid-point  of  the  interval. 
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real  workload.  Denoting  the  test  workload  characteristics  by  primed 
quantities,  we  have 


N 


ijkl 


Pijkl 


N' 


tot 


(2.2) 


This  equation  can  be  physically  interpreted  as  matching  the  joint 
probability  density  of  the  test  workload  with  that  of  the  real  work- 
load. Constraints  on  the  total  number  of  jobs  in  the  test  workload 
or  on  the  total  CPU  time  for  processing  it,  can  be  imposed  in  the 
determination  of  In  this  study  the  constraint  was  placed  on 

the  total  CPU  time,  namely,  2000  seconds.  The  characteristics  of 
the  test  workload  are  derived  using  equation  (2.2)  and  they  are  tabu- 
lated  in  Table  III.  The  test  workload  consists  of  87  jobs. 


The  total  number  of  cells  used  in  the  determination  of  - 

ijkl 

and  g°verned  by  the  value  of  L,  or  the  number  of  cells  along 


each  co-ordinate  axis.  In  our  study  the  value  of  N 


tot 


7776. 


very  large  value  of  L will  result  in  a large  value  of  the  total  number 

4 4 

of  cells  (L  ) . As  seen  from  equation  (2.1)  there  are  L values  of 

Pijkl  (for  example’  Pmi’  Pm2 p2222 PLLLL)#  S±nCe  the 

total  number  of  jobs  (N  ^)  in  the  ensemble  is  fixed,  large  values 

of  L result  in  small  values  of  P^j^]/  It  should  be  pointed  out  that 
there  is  a rounding-off  performed  in  determining  the  from  equa- 

tion (2.2).  If  the  value  of  N! m . is  less  than  0.5  it  will  be  rounded- 

ijkl 

off  to  0.  A large  value  of  L will  tend  to  diffuse  the  distribution 
and  will  result  in  a multi-dimensional  picture  that  is  not  useful 
in  describing  the  workload.  Before  selecting  L the  single  variable 
histograms  should  be  studied  to  determine  the  values  of  the  variables 
that  correspond  to  the  most  frequently  occurring  jobs.  It  is  also 
not  necessary  to  choose  the  same  value  of  L along  all  the  co-ordinate 
axes.  The  number  and  location  of  the  cells  is  influenced  by  the  single 
variable  histograms.  Based  on  these  considerations  the  ensemble  in 
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Table  III 


Characteristics  of  the  Test  Workload 


Group 

Number  of 
Job,  0i1Jkl> 

Core 

(K-bytes) 

Number 
of  Files 

CPU 
(sec. ) 

Block 

Count 

1 

14 

4.5 

3 

2 

25 

2 

4 

4.5 

3 

4 

200 

3 

5 

4.5 

3 

5 

200 

4 

2 

4.5 

3 

9 

1000 

5 

2 

4.5 

3 

29 

1000 

6 

1 

4.5 

3 

73 

5000 

7 

1 

4.5 

6 

6 

200 

8 

1 

4.5 

6 

9 

1000 

9 

1 

4.5 

6 

31 

1000 

10 

1 

4.5 

6 

43 

5000 

11 

1 

6.5 

11 

33 

1000 

12 

1 

6.5 

11 

81 

5000 

13 

5 

17.5 

3 

3 

25 

14 

3 

17.5 

3 

6 

200 

15 

1 

17.5 

3 

9 

1000 

16 

3 

17.5 

3 

34 

1000 

17 

1 

17.5 

3 

78 

5000 

18 

1 

17.5 

6 

1 

25 

19 

2 

17.5 

6 

6 

25 

20 

4 

17.5 

6 

7 

200 

21 

1 

17.5 

6 

11 

1000 

22 

5 

17.5 

6 

31 

1000 

23 

2 

17.5 

6 

77 

5000 

24 

2 

18.0 

11 

7 

200 

25 

6 

18.0 

11 

34 

1000 

26 

1 

18.0 

11 

41 

5000 

27 

1 

18.0 

11 

75 

1000 

28 

6 

18.0 

11 

74 

5000 

29 

1 

29.5 

3 

4 

25 

30 

2 

29.5 

3 

30 

200 

31 

1 

29.5 

6 

8 

200 

32 

1 

29.5 

6 

30 

1000 

33 

1 

29.5 

6 

81 

5000 

34 

1 

30.0 

11 

29 

1000 

35 

2 

30.0 

11 

72 

5000 
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this  study  was  divided  into  144  cells  by  choosing  3 values  of  the 

number  of  files,  3 values  for  the  core  size,  4 values  for  the  direct 

time,  and  4 values  for  the  block  count  to  disk.  Table  IV  tabulates 

the  144  values  for  pJM1. 

ijkl 


SYNTHETIC  WORKLOAD 

The  eighty-seven  jobs  in  the  test  workload  are  realized  by 
using  a synthetic  program.  A COBOL  program  was  designed  and  imple- 
mented for  this  purpose.  It  is  not  practical  to  use  one  synthetic 
program  that  covers  a large  range  of  variation  in  core  size  and 
number  of  files;  further  such  a synthetic  program  will  not  be  flex- 
ible. It  is  simple  and  easy  to  implement  several  synthetic  programs 
to  represent  the  core  size  and  the  number  of  files  which  are  speci- 
fied as  compile  time  parameters.  The  four  variables  to  be  represented 
are  divided  into  two  groups.  They  are: 

Run  time  variables  - Direct  time  and  number  of  block  counts  to  disk. 

Compile  time  variables  - Core  size  and  number  of  files. 

Two  parameters  were  built  into  the  synthetic  program  for  varying  the 
direct  time  and  the  number  of  block  counts  to  disk.  There  are  nine 
combinations  of  core  size  and  the  number  of  files  and  nine  separate 
COBOL  programs  were  compiled  to  reflect  these  nine  combinations. 

The  nine  programs  were  stored  in  auxiliary  storage,  retrieved  at 
run  time  and  executed  using  the  run  time  parameters. 

CALIBRATION  AND  INVERSION 

The  nine  values  of  the  core  size  and  number  of  files  combina- 
tions were  made  equal  to  the  corresponding  values  of  the  test  work- 
load. This  eliminated  the  need  for  calibrating  the  synthetic  program 
for  these  two  parameters.  Each  one  of  the  nine  programs  has  two 
parameters  for  varying  the  direct  time  and  the  number  of  block  counts 
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Table  IV  — Joint  Probability  Distribution  of  the  Real  Workload,  For  example,  the  pair  of  numbers  (1056,  0.136)  for 
I * 4,  J * 3,  K ■ 1000,  L * 25  represent  and  p ^ respectively. 


COKE 

FILES 

CPU 


o 

* 

o 

o 

rg 

O 

o 

* 

o 

o 

o 

* 

O 

o 

J- 

o 

# 

o 

o 

O 

pal 

o 

* 

o 

o 

o 

<NI 

o 

* 

o 

o 

pH 

>0 

o 

* 

o 

o 

O 

o 

o 

* 

o 

o 

o 

o 

o 

# 

o 

o 

o 

o 

in 

* 

<» 

• 

• 

• 

in 

* 

• 

• 

• 

• 

in 

# 

• 

• 

• 

• 

# 

# 

* 

* 

# 

* 

* 

o 

o 

m 

c 

# 

o 

o 

n 

* 

o 

c 

O' 

pH 

* 

PH 

CO 

* 

in 

o 

* 

o 

o 

* 

* 

PH 

* 

pH 

in 

* 

* 

* 

# 

* 

* 

* 

* 

* 

o 

# 

o 

o 

.n 

in 

o 

* 

o 

u. 

n- 

o 

* 

o 

pH 

o 

o 

# 

o 

o 

r\j 

o 

o 

* 

o 

o 

sf 

o 

o 

* 

o 

u 

>o 

c 

o 

* 

o 

o 

o 

o 

o 

* 

o 

o 

o 

o 

o 

* 

o 

o 

o 

o 

pH 

* 

• 

• 

• 

• 

« 

• 

• 

• 

• 

pH 

* 

• 

• 

• 

• 

* 

* 

# 

* 

* 

* 

# 

o 

m 

<0 

* 

o 

c 

o 

* 

o 

st 

* 

r- 

O' 

* 

>c 

<0 

* 

m 

r- 

o 

* 

PH 

# 

PO 

# 

# 

* 

# 

* 

* 

* 

# 

« 

♦ 

o 

* 

iT 

>c 

o 

o 

# 

pH 

<\l 

4 

o 

c 

* 

o 

on 

(Nl 

o 

o 

* 

o 

ng 

o 

o 

o 

* 

o 

>#■ 

o 

o 

o 

* 

c 

pH 

c 

o 

r h 

* 

o 

o 

o 

o 

(M 

* 

CJ 

o 

o 

o 

rj 

# 

o 

o 

c 

c 

* 

• 

• 

• 

• 

* 

• 

• 

• 

• 

# 

• 

• 

• 

• 

* 

* 

* 

* 

• 

* 

* 

ir 

00 

(JO 

* 

o 

in 

O' 

pH 

# 

c; 

o 

n 

o 

* 

r' 

o 

* 

pH 

i\ 

Nf 

* 

pH 

* 

ph 

«■ 

m 

* 

pH 

« 

* 

« 

# 

* 

* 

« 

* 

# 

in 

* 

m 

nj 

o 

pH 

in 

* 

^a 

>0 

o 

o 

in 

« 

o 

PH 

o 

c 

<\l 

« 

in 

O 

o 

o 

nj 

# 

pH 

o 

o 

<\l 

* 

o 

c 

o 

r> 

* 

o 

o 

o 

o 

* 

o 

o 

o 

o 

* 

c> 

c- 

o 

o 

« 

• 

• 

• 

• 

* 

• 

• 

• 

• 

* 

• 

• 

• 

• 

* 

* 

* 

# 

« 

* 

« 

<\l 

r- 

m 

O 

* 

*n 

n 

o 

o 

# 

o 

pH 

o 

o 

« 

pH 

ph 

* 

or 

<N 

« 

pH 

# 

•4- 

* 

pH 

* 

* 

# 

* 

* 

* 

« 

« 

* 

« 

•J 

# # 

# 

* 

* 

* 

« 

* * 

# 

* 

» 

* * 

* 

* 

* 

» 

* 

* 

» 

# 

O 

o 

o 

u 

« 

o 

o 

o 

c 

# 

o 

o 

o 

o 

o 

e 

o 

o 

c 

o 

o 

c 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

m 

^0 

o 

<Nl 

*0 

o 

<Sg 

>0 

o 

r\j 

rf\ 

m 

r- 

( r . 

n- 

n- 

m 

h* 

•n 

h- 

ii 

»l 

ii 

ii 

li 

il 

►H 

PH 

*— 

n 

23 


Table  IV  — Joint  Probability  Distribution  of  the  Real  Workload  (Continued) 
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Table  XV  — Joint  Probability  Distribution  of  the  Real  Workload  (Continued) 


to  disk.  Seventy-five  calibration  runs  were  conducted  to  relate 
these  two  parameters  to  the  direct  time  and  the  block  count.  In  all 
these  calibration  runs  the  LOGGER  summary  data  was  used  to  determine 
the  block  count  and  the  direct  time.  Various  combinations  of  the 
four  parameters,  viz.,  core  size,  number  of  files,  and  the  parameters 
for  direct  time  and  the  block  count  were  used  in  these  calibration 
experiments.  Table  V presents  these  experimental  results. 

These  experimental  results  are  used  in  the  inversion  of  the 
test  workload  characteristics  into  synthetic  program  parameters. 

During  the  calibration  experiments,  the  synthetic  program  parameters 
are  varied  over  the  required  range  and  the  resulting  values  of  the 
direct  time  and  the  number  of  block  counts  are  obtained  from  the 
LOGGER  summary  data.  Inversion  consists  of  reversing  the  above  pro- 
cedure. In  other  words,  given  a set  of  workload  characteristics, 
(direct  time  and  the  number  of  block  counts)  the  required  values  of 
the  synthetic  program  parameters  are  determined  from  the  calibration 
results.  In  this  study  because  of  the  number  of  variables  involved, 
no  attempt  was  made  to  derive  general  expressions  relating  the  syn- 
thetic program  parameters  with  the  workload  characteristics.  Instead 
the  calibration  runs  were  conducted  with  the  values  of  the  parameters 
in  the  immediate  neighborhood  of  the  desired  workload  characteristics. 
The  experimental  results  were  used  to  determine  the  parameter  settings 
for  the  87  jobs  in  the  test  workload.  The  synthetic  workload  is  the 
collection  of  these  87  jobs. 

CLOSURE 

The  performance  of  a computer  system  can  be  described  in  terms 
of  the  interaction  between  the  workload  and  the  hardware-software 
configuration.  A method  of  studying  this  interaction  is  to  conduct 
experiments  with  the  existing  hardware-software  configuration  using 
a stabilized,  reproducible  workload  that  is  representative  of  the 
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Table  V 


Exper  Intent  a 1 Results  Used  in  Calibrating  the  Synthetic  Program 


Synthetic  Program  Parameters 


Job 

Compile  Time 

Run  Time 

Calibration  Results 

No. 

Core 

Files 

Times  thru 

Total  Block 

Core  (K-bytes) 

Files 

CPU  (sec) 

Block  Count 

(i) 

<J> 

CPU  loop  (k) 

Count  (1) 

(i) 

(J) 

(k) 

(1) 

1 

4 

3 

100 

25 

4.5 

3 

1 

25 

2 

4 

3 

100 

200 

4.5 

3 

3 

200 

3 

4 

3 

300 

1000 

4.5 

3 

11 

1000 

4 

4 

6 

1000 

200 

4.5 

6 

11 

200 

5 

4 

6 

800 

5000 

4.5 

6 

49 

5000 

6 

4 

11 

3000 

1000 

6.5 

11 

37 

1000 

7 

17 

3 

100 

25 

17.5 

3 

2 

25 

8 

17 

3 

300 

1000 

17.5 

3 

11 

1000 

9 

17 

3 

10000 

5000 

17.5 

3 

139 

5000 

10 

17 

6 

600 

25 

17.5 

6 

8 

25 

11 

17 

6 

500 

1000 

17.5 

6 

13 

1000 

12 

17 

6 

4000 

5000 

17.5 

6 

82 

5000 

13 

17 

11 

800 

200 

18.0 

11 

13 

200 

14 

17 

11 

1000 

5000 

18.0 

11 

41 

5000 

15 

17 

11 

4000 

5000 

18.0 

11 

89 

5000 

16 

29 

3 

100 

25 

29.5 

3 

6 

25 

17 

29 

3 

2500 

200 

29.5 

3 

28 

200 

18 

29 

3 

5000 

25 

29.5 

3 

46 

25 

19 

29 

6 

10 

200 

29.5 

6 

2 

200 

20 

29 

6 

800 

200 

29.5 

6 

12 

200 

21 

29 

6 

4000 

5000 

29.5 

6 

86 

5000 

22 

29 

11 

4000 

5000 

30.0 

11 

84 

5000 

23 

29 

11 

15000 

1000 

30.0 

11 

146 

1000 

24 

4 

3 

100 

25 

4.5 

3 

2 

25 

25 

4 

3 

50 

200 

4.5 

3 

4 

200 

26 

4 

3 

300 

200 

4.5 

3 

5 

200 

27 

4 

3 

50 

1000 

4.5 

3 

9 

1000 

28 

4 

3 

2500 

1000 

4.5 

3 

37 

1000 

29 

4 

3 

3200 

5000 

4.5 

3 

77 

5000 

30 

4 

6 

300 

200 

4.5 

6 

6 

200 

31 

4 

6 

50 

1000 

4.5 

6 

10 

1000 

32 

4 

6 

2500 

1000 

4.5 

6 

36 

1000 

33 

4 

6 

100 

5000 

4.5 

6 

45 

5000 

34 

4 

11 

2500 

1000 

6.5 

11 

38 

1000 

35 

4 

11 

3000 

5000 

6.5 

11 

72 

5000 

36 

17 

3 

70 

25 

17.5 

3 

1 

25 

37 

17 

3 

100 

200 

17.5 

3 

3 

200 
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Table  V 


Experimental  Resulta  Used  In  Calibrating  the  Synthetic  Program 
(Continued) 


Synthetic  Program  Parameters 

Calibration  Reaults 

Job 

Compile  Time 

Run  Time 

No. 

Core 

Filea 

Tiroes  thru 

Total  Block 

Core  (K-bytes^ 

) Files 

CPU  (aec) 

Block  Count 

(i) 

(J) 

CPU  loop  (k 

) Count  (1) 

(i) 

(J) 

00 

(1) 

38 

17 

3 

10 

1000 

17.5 

3 

10 

1000 

39 

17 

3 

2000 

1000 

17.5 

3 

32 

1000 

40 

17 

3 

4000 

5000 

17.5 

3 

76 

5000 

41 

17 

6 

10 

25 

17.5 

6 

2 

25 

42 

17 

6 

400 

25 

17.5 

6 

6 

25 

43 

17 

6 

100 

200 

17.5 

6 

4 

200 

44 

17 

6 

10 

1000 

17.5 

6 

9 

1000 

45 

17 

6 

2000 

1000 

17.5 

6 

30 

1000 

46 

17 

6 

3000 

5000 

17.5 

6 

72 

5000 

47 

17 

11 

100 

200 

18.0 

11 

5 

200 

48 

17 

11 

2300 

1000 

18.0 

11 

33 

1000 

49 

17 

11 

500 

5000 

18.0 

11 

47 

5000 

50 

17 

11 

6 500 

1000 

18.0 

11 

76 

1000 

51 

17 

11 

3000 

5000 

18.0 

11 

75 

5000 

52 

29 

3 

10 

25 

29.5 

3 

2 

25 

53 

29 

3 

2600 

200 

29.5 

3 

28 

200 

54 

29 

6 

400 

200 

29.5 

6 

7 

200 

55 

29 

6 

3000 

1000 

29.5 

6 

48 

1000 

56 

29 

6 

3000 

5000 

29.5 

6 

74 

5000 

57 

29 

11 

3000 

1000 

30.0 

11 

38 

1000 

58 

29 

11 

3000 

5000 

30.0 

11 

72 

5000 

59 

4 

3 

0 

200 

4.5 

3 

4 

200 

60 

4 

3 

0 

1000 

4.5 

3 

9 

1000 

61 

4 

3 

2000 

1000 

4.5 

3 

29 

1000 

62 

4 

3 

3000 

5000 

4.5 

3 

73 

5000 

63 

4 

6 

0 

1000 

4.5 

6 

9 

1000 

64 

4 

6 

2000 

1000 

4.5 

6 

31 

1000 

65 

4 

6 

0 

5000 

4.5 

6 

43 

5000 

66 

4 

11 

2000 

1000 

6.5 

11 

33 

1000 

67 

17 

3 

400 

200 

17.5 

3 

6 

200 

68 

17 

3 

0 

1000 

17.5 

3 

9 

1000 

69 

17 

6 

350 

200 

17.5 

6 

7 

200 

70 

17 

6 

0 

1000 

17.5 

6 

11 

1000 

71 

17 

11 

300 

200 

18.0 

11 

7 

200 

72 

17 

11 

2000 

1000 

18.0 

11 

34 

1000 

73 

17 

11 

0 

5000 

18.0 

11 

41 

5000 

74 

29 

6 

2000 

1000 

29.5 

6 

30 

1000 

75 

29 

11 

2000 

1000 

30.0 

11 

29 

1000 
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real  workload.  That  there  is  a need  for  constructing  a representative 
test  workload  need  not  be  overemphasized.  In  this  study  the  represen- 
tative workload  is  constructed  by  first  isolating  the  most  frequently 
occurring  jobs. 

The  accounting  package  seldom  captures  data  about  the  human  fact- 
ors involved,  e.g.,  time  for  mounting  and  dismounting  tapes.  Special 
measurements  have  to  be  made  to  determine  the  human  factors.  Typical 
measurements  are  the  number  of  frequently  used  tapes,  and  the  method 
of  tape  assignments. 

The  following  procedure  may  be  adopted  for  evaluating  the  perfor- 
mance of  a computer  system  in  which  tape  files  dominate.  The  workload 
can  be  divided  into  two  distinct  classes  - jobs  with  disk  I/O  only 
and  jobs  with  tape  and/or  disk  I/O.  The  former  class  can  be  represented 
using  methods  described  in  this  section.  The  latter  class  can  be  rep- 
resented with  the  use  of  synthetic  programs  that  create  tape  files. 

The  combined  representative  synthetic  workload  can  then  be  used  to 
study  the  effects  of  system  parameters;  for  example,  changes  in  the 
configuration,  addition  to  the  existing  configuration,  blocking  factor 
etc.  In  these  experiments  care  should  be  taken  to  insure  that  the 
human  factors  are  adequately  simulated  by  introducing  known  amounts 
of  delay  in  mounting  and  dismounting  tapes. 
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SECTION  III 


EXPERIMENTS  WITH  THE  TEST  WORKLOAD 


PURPOSE 

A computer  system  can  be  considered  as  a hardware-software- 
configuration  (HSC)  interacting  with  its  workload.  Evaluation  of 
computer  system  performance  requires  an  understanding  of  this  inter- 
action in  order  to  assess  and  improve  the  performance  and  to  predict 
the  effects  of  changes  in  either  the  workload  or  the  HSC. 

In  this  study,  experiments  were  conducted  with  the  B 3500  using 
a representative  model  of  the  real  workload.  The  controlled  experi- 
ments were  conducted  in  a dedicated  environment.  A hardware  monitor 
(DYNAPROBE)  was  used  to  measure  the  utilization  of  the  processor,  the 
channel  and  the  physical  devices. 

EXPERIMENTAL  RESULTS 

The  test  workload,  representative  of  the  batch  jobs  processed 
during  a month  on  the  B 3500  (ESD,  Hanscom  Air  Force  Base)  consisted 
of  87  jobs.  Their  CPU  time,  core  used,  number  of  files  and  block  count 
to  disk  are  presented  in  Table  III.  LOGGER  data  was  used  to  obtain 
the  following  characteristics  for  each  job. 

a)  Start  Time 

b)  Stop  Time 

c)  Direct  time  used 

d)  Core  requested 

e)  Number  of  files 

f)  Block  count  per  file 
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A hardware  monitor,  DYNAPROBE  7900,  was  used  to  measure  the 
following  quantities. 

a)  CPU  busy  in  the  normal  state 

b)  CPU  idle 

c)  Disk  channel  2 busy  (primary) 

d)  Disk  channel  10  busy  (alternate) 

e)  Disk  channels  2 and  10  busy 

Figure  4 is  a schematic  of  the  hardware  configuration  used  in  these 
experiments.  The  electrical  signals  corresponding  to  the  above  quan- 
tities were  recorded  on  tape  which  was  later  analyzed  to  determine 
the  utilization  values. 

The  architecture  of  the  B 3500  was  initially  studied  to  determine 
the  nature  of  the  experiments  to  be  performed.  In  this  system  there 
are  six  disk  units  connected  to  the  processor  by  a pair  of  channels 
(channel  2 and  10)  to  achieve  simultaneity.  The  disk  units  are  head- 
per-track  units.  As  there  is  only  one  primary  channel  (channel  2) 
for  all  the  disk  units,  overlap  between  data  transfers  can  be  accom- 
plished by  forcing  the  transfer  to  take  place  through  the  secondary 
channel  (channel  10) . Because  of  this  channel  - disk  unit  relation- 
ship it  is  not  possible  to  initiate  many  data  transfers,  analogous 
to  parallel  seeks  characteristic  of  moving  arm  disk  units. 

The  degree  of  multiprogramming,  the  number  of  jobs  co-resident 
in  main  memory,  is  determined  mainly  by  the  core  size  of  the  available 
jobs  in  the  mix.  In  the  experiments  reported  here,  all  the  jobs 
were  initially  spooled  to  the  disk.  The  first  set  of  jobs  were  ini- 
tiated. The  number  of  jobs  in  this  set  was  determined  by  the  indivi- 
dual core  sizes.  At  the  termination  of  a job,  the  operating  system 
scans  the  list  of  available  jobs  and  initiates  a job  which  can  fit 
into  the  vacant  core  space.  If  such  a job  cannot  be  found  the  opera- 
ting system  waits  for  a second  job  to  terminate  and  the  search  for 
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Figure  4 B-3500  H A R D W A R E CON  F I GU  R ATI  0 N 
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the  next  job  is  repeated  with  the  difference  that  the  vacant  space 
now  available  has  increased.  For  this  reason  the  degree  of  multi- 
programming varies  during  a session.  The  multiprogramming  (number  of 
jobs  in  the  mix)  resulting  from  one  of  the  experimental  sessions  is 
shown  in  Figure  5. 

Eight  runs  were  conducted  using  the  test  workload.  The  effect 
of  external  sequencing  (i.e.,  the  order  in  which  the  jobs  are  pro- 
cessed by  the  system)  on  the  overall  performance  of  the  computer 
system  was  studied.  The  first  experiment  was  run  single-thread  to 
verify  the  characteristics  of  the  test  workload.  The  second  experi- 
ment was  run  with  multiprogramming  in  which  the  jobs  were  initiated 
from  the  operator’s  console  after  every  job  termination.  This  was 
found  to  be  inefficient  as  the  system  was  idling  waiting  for  jobs 
to  be  initiated.  In  the  third  experiment  the  job  initiating  was 
accomplished  automatically  by  the  operating  system  with  no  operator 
intervention.  It  was  found  that  changes  in  the  external  sequencing 
have  no  appreciable  effect  on  the  overall  performance.  This  is  not 
very  suprising  in  view  of  the  fact  that  the  workload  is  CPU-bound. 

In  single  processor,  mu It ip ro gramme d systems,  CPU-bound  workloads 
lead  to  a situation  in  which  all  the  jobs  have  to  wait  to  use  the 
processor,  since  the  overlap  between  the  processor  and  I/O  usage  is 
very  small. 

The  test  workload  used  only  disk  files  and  the  I/O  activities  were, 
therefore,  much  faster  than  the  equivalent  I/O  activities  to  tape.  In 
effect,  the  I/O  load  on  the  system  was  reduced  and  the  test  workload 
was  made  CPU-bound.  This  is  confirmed  by  the  fact  that  overall  CPU  uti- 
lization of  the  month’s  workload  studied  was  approximately  25%  and 
the  CPU  utilization  of  the  test  workload  was  approximately  65%.  It 
was  necessary  to  adjust  the  I/O  load  on  the  system  so  that  the  CPU 
utilization  became  reasonable. 
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The  synthetic  workload  used  allows  for  adjustment  of  the  I/O  load 
by  means  of  the  I/O  run  time  program  parameter.  The  assumption  is  made 
that  increasing  the  I/O  load  will  bring  the  CPU  utilization  down  to  a 
value  approaching  that  observed  for  the  real  workload.  This  implies 
that  the  neglected  tape  I/O  activities  can  be  replaced  by  a suitable 
number  of  disk  I/O  activities.  The  I/O  control  parameter  for  all  the 
jobs  in  the  test  workload  was  increased  by  a factor  of  5.  The  result 
was  that  the  elapsed  time  of  the  experiment  became  too  large.  Only  36 
of  the  total  of  87  jobs  were  completed  in  slightly  more  than  2 hours. 

The  number  of  block  counts  was  then  increased  by  a factor  of  2 and  the 
resulting  processor  utilization  was  found  to  be  45.8%.  In  both  these 
experiments  it  was  found  that  replacing  the  tape  I/O  with  disk  I/O  led 
to  large  values  of  channel  utilization.  In  trying  to  decrease  the  CPU 
utilization  to  realistic  values  we  were  in  effect  increasing  the  channel 
utilization  beyond  the  realistic  values.  In  real  life  situations  the 
tape  drives  and  disk  drives  are  connected  through  separate  channels  and 
it  is  not  possible  to  study  the  behaviour  of  such  a system  by  using  a 
test  workload  that  places  the  total  I/O  load  on  any  one  of  the  channels. 
The  overall  summary  of  experimental  results  is  shown  in  Table  VI. 

HARDWARE  MONITORING 

A hardware  monitor,  DYNAPROBE  7900*,  is  used  to  measure  the 
resource  utilization  of  the  major  resources  during  the  controlled 
experiments.  The  hardware  monitor  is  a high  impedance  meter  that 
measures  the  electrical  signals  that  correspond  to  the  busy  or  the 
idle  state  of  a given  resource.  The  hardware  monitor,  because  of 
its  high  impedance,  does  not  perturb  the  system  being  measured.  This 
is  a decided  advantage  over  software  monitors  which  influence  the 
performance  of  the  system  being  measured.  The  hardware  monitor 


*Comress , Incorporated . 


Table  VI 


Overall  Summary  of  the  Experimental  Results 


Experiment 

Number 


Elapsed 
Time  (sec.) 


Processor 
Time  (sec.) 


Total  Block 
Count 


CPU 

Utilization 


Remarks 


1 


6896 


1794 


110,775 


26.0% 


87  jobs,  single  thread, 
structured  schedule. 


2 


3344 


1994 


110,750 


59.6% 


86  jobs,  mult  1-program- 
ming by  executing  jobs 
from  operator’s  console; 
structured  schedule. 


3 


4 


3025 


3682 


2034 


2071 


110,750 


110,775 


67.2%  86  jobs,  multi-program- 

ming with  MCP  scheduling 
and  executing  the  jobs; 
structured  schedule. 

56.2%  87  jobs,  multi-program- 

ming (same  as  Exp.  3), 
unstructured  schedule. 


5 


6 


7 


7726 


5825 


6396 


2669 


2666 


2937 


216,733 


202,100 


221,550 


34.5% 


45.8% 


45.9% 


36  jobs,  8 partial  jobs, 
multi-programming  (same 
as  Exp.  3),  unstructured 
schedule,  increased  1/0 
by  factor  of  5. 

74  jobs,  incomplete  run, 
multi-programming  (same 
as  Exp.  3) , unstructured 
schedule,  doubled  1/0 
activity. 

87  jobs,  same  as  Exp.  6 


8 


5127 


2936 


221,550 


57.3% 


87  jobs,  multi-program- 
ming (same  as  Exp.  3), 
structured  schedule, 
doubled  1/0  activity. 
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captures  the  state  of  the  various  system  resources  continuously  and 
its  output  is  written  out  to  tape.  The  hardware  monitor  is  capable 
of  measuring  the  state  of  a large  number  of  system  resources  simul- 
taneously, by  means  of  hardware  probes  that  are  connected  to  the 
appropriate  pins  in  the  computer.  These  probe  signals  can  be  com- 
bined (logically)  to  determine  the  degree  of  overlap.  The  hardware 
monitor  output  is  collected  and  later  analyzed  by  means  of  special 
software  routines  to  determine  the  utilization  values. 

The  following  were  measured  using  the  hardware  monitor. 

1)  Processor  busy  in  the  normal  state 

2)  Processor  idle 

3)  Channel  2 busy 

4)  Channel  10  busy 

5)  Channel  2 and  channel  10  busy 

At  the  end  of  the  experiment  the  output  tape  is  analyzed  by 
a special  software  package.  The  output  of  this  program  appears  in 
Table  VII.  The  program  calculates  two  sets  of  utilization  for  the 
resources  indicated  in  the  table.  The  first  value  is  the  value  for 
the  interval.  The  second  value  is  the  cumulative  value  for  the 
duration  of  the  test  period.  The  cumulative  values  of  the  utiliza- 
tion are  used  for  the  purpose  of  comparison  with  the  corresponding 
values  calculated  from  the  accounting  data. 
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lAble  VII  — DYNAPROBE  HArdw*re  Monitor 


SECTION  IV 


SUMMARY 

Computer  performance  studies  can  be  classified  into  two  groups  - 
assess  and  improve  the  performance;  and  predict  the  effects  of 
changes  in  either  the  workload  or  the  hardware-sof tware  configuration. 
Efforts  in  this  area  are  greatly  helped  by  a proper  understanding  of 
the  interaction  between  the  workload  and  the  hardware-software-config- 
uration. This  report  presents  an  approach  that  may  be  useful  in 
understanding  this  interaction. 

Experiments  are  performed  with  a Burroughs  B 3500  computer  sys- 
tem (ESD,  Hanscom  Air  Force  Base)  using  a synthetic  test  workload,  in 
a dedicated  environment.  The  method  of  constructing  the  test  workload 
has  been  described.  The  method  consists  of  matching  the  joint  pro- 
bability density  of  the  real  workload  with  that  of  the  test  workload. 
Direct  time,  I/O  to  disk,  core  size  and  the  number  of  files  are  the 
four  characteristics  selected  for  representation.  The  magnitudes  of 
these  characteristics  are  derived  from  the  LOGGER  summary  data. 

In  these  controlled  experiments  the  resource  utilization  is 
monitored  using  a hardware  monitor.  The  values  of  the  utilization 
calculated  from  the  accounting  data  compare  favorably  with  those 
measured  by  the  monitor.  This  is  a very  significant  finding  of  this 
study.  This  study  has  shown  that  the  accounting  data  collected  by 
the  LOGGER  is  sufficient  to  calculate  the  processor  and  the  channel 
utilization  for  the  B 3500  computer  system.  LOGGER  provides  a ready 
and  economical  source  of  data  for  calculating  the  resource  utiliza- 
tion that  is  very  useful  in  analyzing  the  performance  of  the  B 3500 
system.  A system  manager  can  conveniently  assess  the  system  perform- 
ance by  periodic  processing  of  the  accounting  data  to  determine  the 
resource  utilization. 
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The  test  workload  used  in  these  experiments  does  not  include 
tape  I/O  activities.  In  installations  where  the  tape  I/O  activities 
dominate,  the  synthetic  workload  experiments  should  be  preceded  by 
an  analysis  of  the  human  factors  involved.  Some  examples  of  these 
human  factors  are  the  layout  of  the  tape  library,  the  number  of  tapes 
in  the  library  and  the  time  taken  for  mounting  and  dismounting  tapes. 
These  human  factors  have  a significant  influence  on  the  computer 
performance . 

The  tape  I/O  can  be  accounted  for  by  creating  tape  files  in  the 
synthetic  program.  The  workload  can  be  divided  into  two  distinct 
classes  - jobs  with  disk  I/O  only  and  jobs  with  tape  and/or  disk  I/O. 
The  combined  synthetic  workload  can  then  be  used  to  study  the  effects 
of  system  parameters;  for  example,  changes  in  the  configuration, 
addition  to  the  existing  configuration,  blocking  factor  etc. 

Representative,  synthetic  workloads  are  not  ends  in  themselves 
but  just  means  to  an  end.  They  can  be  used  as  stabilized,  reproducible 
workloads  in  conducting  experiments  with  an  existing  hardware-software- 
configuration  to  evaluate  the  performance  of  the  computer  system. 

The  synthetic  workloads  have  the  additional  advantage  that  they  are 
flexible.  By  varying  the  synthetic  program  parameters  the  character- 
istics of  the  test  workload  can  be  changed. 
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APPENDIX 


CALCULATION  OF  THE  RESOURCE  UTILIZATION 


The  utilization  of  the  processor  and  the  I/O  channels  was 
monitored  with  a hardware  monitor.  The  accounting  package,  LOGGER, 
collected  the  direct  time  for  each  job  and  the  total  number  of  block 
counts  transferred  by  each  job.  In  this  section  methods  for  calcu- 
lating the  resource  utilization  from  the  accounting  data  are  described. 
The  calculated  values  of  the  resource  utilization  are  then  compared 
with  those  measured  by  the  hardware  monitor. 

PROCESSOR 

The  direct  time  (d^)  of  every  job  in  the  synthetic  workload  is 
determined  from  the  LOGGER  data.  Let  the  elapsed  time  for  the  syn- 
thetic workload  be  T and  N be  the  total  number  of  jobs  in  the  workload. 
(N  = 87  in  our  study.)  The  processor  utilization  & ^ in  the  normal 
state  is  given 


The  processor  utilizations  for  the  seven  runs  were  calculated 
using  the  equation  (Al) . Table  VIII  presents  these  values  and  the 
corresponding  values  measured  by  the  hardware  monitor  are  also  pre- 
sented in  the  table  for  the  purposes  of  comparison.  The  two  sets 


of  values  agree  fairly  well.  The  LOGGER  itself  consumes  some  CPU 
time  but  it  does  not  measure  itself.  The  hardware  monitor  on  the 
other  hand,  measures  the  LOGGER  activity.  The  time  for  interrupt 
processing  is  usually  charged  to  the  interrupted  job  if  no  job  switching 
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takes  place.  The  LOGGER  collects  the  data  whenever  the  processor  is 
in  the  normal  state  and  does  not  collect  the  data  when  the  system  is 
in  the  master  state  (i.e.,  supervisory  state).  But  the  hardware 
monitor  does  collect  the  data  for  both  the  states.  These  consider- 
ations should  be  borne  in  mind  when  comparing  the  results  in  Table 
VIII. 

I/O  CHANNEL 

The  LOGGER  records  the  total  number  of  block  counts  to  each 
channel.  In  the  channel-device  architecture,  there  is  a primary 
channel  (channel  2)  and  a secondary  channel  (channel  10)  to  achieve 
simultaneity.  This  results  in  the  transfer  of  the  block  by  channel 
2.  Whenever  it  is  busy  channel  10  takes  over  the  transfer.  The 
LOGGER  does  not  distinguish  between  the  number  of  block  counts  trans- 
ferred to  the  primary  and  the  secondary  channel,  but  measures  the 
total  block  count  for  the  primary  and  the  secondary  combination. 

The  hardware  monitor,  on  the  other  hand,  distinguishes  between  the 
primary  and  the  secondary  channel  and  measures  the  utilization  of 
the  two  channels  separately. 

The  devices  used  in  the  B3500  system  are  head-per-track  disks 
and  the  data  transfer  takes  place  in  three  phases.  During  the  first 
phase  the  track  address  is  analyzed  and  the  corresponding  read/write 
head  is  connected  to  the  channel.  During  the  second  phase  the  read/ 
write  head  waits  for  the  beginning  of  the  track  to  rotate.  The 
second  phase  is  referred  to  as  the  latency.  During  the  third  phase 
the  data  transfer  takes  place.  It  should  be  pointed  out  that  the 
channel  and  the  device  are  busy  during  all  the  three  phases.  The 
seek  is  absent  in  the  transfer  process  since  no  arm  movement  is  in- 
volved . 
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The  time  per  block  count,  t , can  be  expressed  as 


t = (ave.  Latency)  + (Transfer  Time)  + (Overhead) 


(A2) 


where  the  average  latency  is  the  time  for  half  a revolution  of  the 
disk,  the  transfer  time  is  a function  of  the  block  size  and  the 
rate  of  transfer  and  the  overhead  includes  the  time  for  switching 
from  track  to  track.  The  hardware  characteristics  of  the  disk- 
channel  unit  are  given  below. 


The  block  size  for  all  the  files  in  the  test  workload  was  50  bytes. 
Based  on  these  values  the  average  time  per  block  count  is  given  by 

t = 23.0  + ~y  • + 1.8  ^ 23.0  + 0.13  + 1.8  — 25  milliseconds  (A3) 
The  channel  utilization  can  be  calculated  from  the  following  expression 


Type 


B9372-9 


Rotation  Speed 
Max.  Latency 
Ave.  Latency 
Rate  of  Transfer 
Overhead 


23  milliseconds 
377  kilobytes/second 
1.8  milliseconds 


46  milliseconds 


1300  RPM 


(t)  (B) 


(A4) 


T 


where 


- channel  utilization 

t - time  per  block  count  = 0.025  seconds 
B - total  block  count 

T - elapsed  time  for  the  test  workload 


43 


REFERENCES 


1-  ADPE  Performance  Management  System,  AFM  171-400,  Volume  III, 

1 December  1972. 

2.  Burroughs  B 2500  and  B 3500  Information  Processing  Systems  Assem- 
bler Reference  Manual,  PCN  1034949,  November  20,  1970. 

3.  J.  Esposito,  "Statistical  Analysis  to  Determine  Digital  Computer 
Workload  Characteristics,"  ESD-TR-74-175 , June  1974. 


4.  K.  Sreenivasan  and  A.  J.  Kleinman,  "Construction  and  Applications 
of  Representative  Synthetic  Workloads,"  ESD-TR-73-212 , August  1973. 


46 


